Amazon EMR Reviews & Ratings 2024

Overview

What is Amazon EMR?

Recent Reviews

Amazon EMR is ideal for Hadoop-based processing

8 out of 10

April 22, 2022

The AWS stack is a big component of the majority of our work. When necessary, EMR is employed in a number of these settings. When we need …

Amazon EMR - fast, and elastic

7 out of 10

April 19, 2022

Incentivized

We use Amazon EMR (Elastic MapReduce) to run various types of algorithms related to health like calculation of body mass index, heart rate …

Amazon EMR is worth it when you have experience

8 out of 10

April 11, 2022

Incentivized

For some clients, we have our product hosted on several AWS products, and when it comes to retrieving big volumes of data we use the …

EMR: Great Services for Analytics

10 out of 10

April 09, 2022

Incentivized

On request transitory clusters for huge information handling. I like its accessibility completely different taken a toll tire makes it …

Good product to try with...Amazon EMR.

9 out of 10

April 08, 2022

Incentivized

Having our product hosted on various AWS EC2 instances for some clients and when it requires pulling large amounts of data and performing …

Amazon EMR - It's very intuitive and easy to use.

10 out of 10

April 08, 2022

Incentivized

We migrated the entire hadoop structure to Amazon EMR, the cost and maintenance are much better compared to other solutions on the market. …

My chosen scaleable cloud platform

9 out of 10

April 06, 2022

Incentivized

I use Amazon EMR (Elastic Map Reduce) as a scalable platform to deploy my client solutions onto. It allows me to scale our solution …

AWS has it all!

7 out of 10

April 06, 2022

Incentivized

To keep my review simple it is very convenient that AWS has a MapReduce tool as it was easy to deploy and test with our cloud setup. Also …

Awesome Services to process data seamlessly

7 out of 10

April 06, 2022

Incentivized

We use it to process our data for real-time prediction for one of our AI models. This has tremendously reduced our effort and time. We …

Amazon EMR is the extremely large datasets transformation solution

8 out of 10

December 10, 2021

Incentivized

Most commonly we use Amazon ec2 for daily work on the cloud but when it comes to pulling large amounts of data and performing large …

Amazon EMR - Truly elastic

10 out of 10

December 03, 2021

Incentivized

Used as spark cluster to enable Big data ETL processes. Analysists and data scientists uses clusters for adhoc querying purposes. Raw data …

Amazon EMR Review

7 out of 10

September 22, 2020

Incentivized

Amazon EMR is being used by our organization to simplify running big data frameworks, and provide the Amazon EMR highlights, product …

AWS's EMR: the best option out there

10 out of 10

September 20, 2020

Incentivized

We are a certified AWS partner agency, and we use a lot of the AWS stack for most of our projects. EMR is used in several of them when …

Amazon EMR is a great way to manage and analyze data!

8 out of 10

September 11, 2020

Incentivized

Amazon Eliastic MapReduce may be a mouthful (EMR is much easier to say) but like taking that string and reducing it to its acronym, it …

Amazon Elastic MapReduce is Worth the Effort if You Really Need Big Data Management, Just Make Sure You Really Need It

8 out of 10

February 04, 2019

Incentivized

Amazon Elastic MapReduce is used by my department to produce big data analytics for certain clients. The software address data mining and …

Read all reviews

Awards

Products that are considered exceptional by their customers based on a variety of criteria win TrustRadius awards. Learn more about the types of TrustRadius awards to make the best purchase decision. More about TrustRadius Awards

Return to navigation

Product Details

About
Tech Details
FAQs

What is Amazon EMR?

Amazon EMR Technical Details

Operating Systems	Unspecified
Mobile Application	No

Frequently Asked Questions

Amazon EMR is a cloud-native big data platform for processing vast amounts of data quickly, at scale. Using open source tools such as Apache Spark, Apache Hive, Apache HBase, Apache Flink, Apache Hudi (Incubating), and Presto, coupled with the scalability of Amazon EC2 and scalable storage of Amazon S3, EMR gives analytical teams the engines and elasticity to run Petabyte-scale analysis.

Reviewers rate Support Rating highest, with a score of 9.

The most common users of Amazon EMR are from Small Businesses (1-50 employees).

Return to navigation

Comparisons

View all alternatives

Compare with

Reviews and Ratings

(60)

Attribute Ratings

Reviews

(1-5 of 5)

Sort By *

Companies can't remove reviews or game the system. Here's why

April 22, 2022

Amazon EMR is ideal for Hadoop-based processing

Verified User

Engineer in Engineering

Computer Software Company, 201-500 employees

Score 8 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

The AWS stack is a big component of the majority of our work. When necessary, EMR is employed in a number of these settings. When we need to process a large amount of data across several EC2 servers, our DevOps team implements it. For our customers, EMR is attractive since it is far less expensive to adopt than alternative solutions, which means that the overall cost savings are substantial.

Pros and Cons

Faster than prior on-premise systems to put in place.
Open source software is supported.
Reduces the cost of production.

Automation of processing jobs creation and deletion.
The cost of this service is more expensive than similar ones.
Getting everything up and running at the beginning is a lengthy process.

Likelihood to Recommend

You can use Amazon EMR if you wish to shift to the cloud and save money by using Apache Spark or Apache Hadoop on-premises. When the amount of work you have to handle data fluctuates a lot. Setting up flexible and scalable scenarios with AWS's EMR can assist you.

Most Important Features

Compatibility between Spark and Hive workloads in Hadoop.
EC2 Cluster Monitoring and Logging is at least five times slower than our data processing.

Return on Investment

Cost reductions for consumers are enormous as compared to other (more conventional) choices, particularly on-premise ones.
S3 and EC2 API connection allowed for increased scalability.

Alternatives Considered

Snowflake

Snowflake is a lot easier to get started with than the other options. Snowflake's data lake building capabilities are far more powerful. Although Amazon EMR isn't our first pick, we've had an excellent experience with EC2 and S3. Because of our current API interfaces, it made more sense for us to continue with Hadoop rather than explore other options.

Other Software Used

Amazon Redshift, Amazon EC2 Auto Scaling

April 08, 2022

Amazon EMR - It's very intuitive and easy to use.

Verified User

Analyst in Information Technology

Information Technology & Services Company, 1-10 employees

Score 10 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

We migrated the entire hadoop structure to Amazon EMR, the cost and maintenance are much better compared to other solutions on the market. We created a recommender system filter in big data. We needed a low runtime to meet our demand and we were able to get through the Amazon EMR.We migrated the entire hadoop structure to Amazon EMR, the cost and maintenance are much better compared to other solutions on the market. We have a lot of data science tasks, like calculating statistics between various math calculations to apply the business rules. Definitely one of the best services to work on bigdata.

Pros and Cons

Faster processing.
The distributed computation of the calculations.
Easy to setup.
Monitoring as an add up.
Can be integrated with lots of technologies.

Overhead delay in starting clusters which can cause problems.

Likelihood to Recommend

It provides a nice graphical user interface to manage and work with big data map reduction tasks instead of manual configuration with hadoop or cli.it saves a lot of time and effort.We create big data monitoring system filters.

It provides a good GUI to manage and handle big data map reduction tasks and its configuration saves a lot of time and effort.

Most Important Features

Configuration simplify
Spark

Return on Investment

We use Hadoop and Spark frameworks. To process and store big data

Alternatives Considered

Good choice for startup, open source and cost-effective and saves a lot of setup time.
Run times are reduced to minutes compared to hours on EC2 or other compute servers.
Easy to choose between hadoop or spark based EMR cluster, it can be used in combination with other AWS services, for example we can create budgets involving EMR and various other tasks in AWS data pipeline service.

Other Software Used

Azure DevOps Server (formerly Team Foundation Server), AWS Lambda, Amazon Elastic Compute Cloud (EC2)

September 20, 2020

AWS's EMR: the best option out there

Nicolas Costa Ossa

Director of Customer Operations & Account Management

Koombea (Internet, 51-200 employees)

Score 10 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

We are a certified AWS partner agency, and we use a lot of the AWS stack for most of our projects. EMR is used in several of them when required. It is implemented by our DevOps team and we pretty much use it when we need to process a lot of data throughout EC2 instances. EMR is very compelling to our customers because it is easier to implement (hence less dev cost) and it is way more efficient when managing the data VS other tools, so the overall cost reduction is considerable.

Pros and Cons

Easier to implement than older on-premise solutions
Works with open source technologies.
Keeps processing cost low.
It is flexible and works also for short term workloads and the pricing changes to that model.

You definitely need to be trained before using it because the interface can be a little confusing. It is a professional service model, so I recommend a certified dev.

Likelihood to Recommend

For example, when you have Apache Spark on-premise deployments, or also Apache Hadoop, and you want to move to the cloud and reduce costs, EMR is the right tool. When you have lots of ups and downs in workload levels to process data. AWS's EMR can help you by setting up flexible/scalable scenarios.

Return on Investment

As an agency, we have won new customers just by having these EMR capabilities as a service.
For the customers, the savings on costs are huge when compared with ore (more traditional) options, especially with on-premise ones.

Alternatives Considered

Apache Spark and Databricks Unified Analytics Platform

EMR is more suited for developers. Databricks feel more for data science-oriented with its notebooks and customs visualizations. With EMR you can more easily add additional capacity on-damnd on the instance. With others is a more cumbersome process. And then, you can also configure it to be dynamic and change depending on your usual flow of data.

Support Rating

There's a vast group of trained and certified (by AWS) professionals ready to work for anyone that needs to implement, configure or fix EMR. There's also a great amount of documentation that is accessible to anyone who's trying to learn this. And there's also always the help of AWS itself. They have people ready to help you analyze your needs and then make a recommendation.

Usability

Experience and time using the platform play a huge portion in the overall usability experience. It is a complex and powerful tool, so training is necessary.

October 25, 2017

AWS EMR at a glance!!

Verified User

Engineer in Information Technology

Automotive Company, 1001-5000 employees

Score 7 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

We have used AWS EMR before starting to use Databricks on EC2 instances. EMR was solving the problem but we needed a better solution (Enterprise edition) to manage our Workbooks and better scheduler for running or jobs. EMR was working fine but we did not find it user friendly to add the data nodes on demand. We used EMR primarily to process the data on AWS S3 using Hadoop and Spark frameworks. We have also used AWS SWF to orchestrate our job flow by adding steps. It was used widely by the data processing team and not by the entire organization as most of the data was on local servers. It addresses problems like processing data which might not need to be processed live as the cluster can be spun up and shut down once the job is completed. It is cost efficient (especially if you do not need data nodes and only task nodes), scalable and reliable.

Pros and Cons

EMR does well in managing the cost as it uses the task node cores to process the data and these instances are cheaper when the data is stored on s3. It is really cost efficient. No need to maintain any libraries to connect to AWS resources.
EMR is highly available, secure and easy to launch. No much hassle in launching the cluster (Simple and easy).
EMR manages the big data frameworks which the developer need not worry (no need to maintain the memory and framework settings) about the framework settings. It's all setup on launch time. The bootstrapping feature is great.

Sometimes bootstrapping certain tools comes with debugging costs. The tools provided by some of the enterprise editions are great compared to EMR.
Like some of the enterprise editions EMR does not provide on premises options.
No UI client for saving the workbooks or code snippets. Everything has to go through submitting process. Not really convenient for tracking the job as well.

Likelihood to Recommend

EMR is suited if the jobs are long running and doesn't really need much monitoring. EMR is really flexible in processing the data on s3 as a developer doesn't need to spend time on debugging the connections to s3 from a big data framework as most of the configuration is taken care of by Amazon. Very cheap when compared to most of the solutions on the market and the ready to go configuration at the launch time reduces the amount of time required for admin tasks. So, considering the cheap cost, processing options on s3 and scalability via adding task nodes, EMR serves a better purpose for startups considering open source and cost efficient options.

However, EMR comes with its own disadvantages. There is no proper UI to track real time jobs which is however possible with Enterprise editions like Cloudera, Hortonworks etc. EMR could provide an interface to add workbooks and code snippets in the cluster as it would reduce the time to submit the tasks. EMR also lags the potential to automatically replace unhealthy nodes.

Return on Investment

It was obviously cheaper and convenient to use as most of our data processing and pipelines are on AWS. It was fast and readily available with a click and that saved a ton of time rather than having to figure out the down time of the cluster if its on premises.
It saved time on processing chunks of big data which had to be processed in short period with minimal costs. EMR solved this as the cluster setup time and processing was simple, easy, cheap and fast.
It had a negative impact as it was very difficult in submitting the test jobs as it lags a UI to submit spark code snippets.

Alternatives Considered

Databricks and Hortonworks Data Platform

Having one of these enterprise edition license comes at its own costs. But, the flexibility to have the cluster spin up with the workbenches and code snippets on the same is really beneficial. Especially, if one had to move out of EMR and consider an option which reduces the debugging time in establishing connections to AWS resources, I would love to used the mentioned three resources on EC2. This would definitely make the processing time to reduce as there is a flexibility to test real time and execute the code snippet and look at the performance and monitor the snippet in real time.

Other Software Used

Databricks, Amazon Elastic Compute Cloud (EC2), Amazon DynamoDB, Amazon S3 (Simple Storage Service), Amazon Aurora, Amazon Redshift, Amazon CloudFront, Amazon CloudWatch

June 22, 2016

Research Experience with Amazon EMR

Verified User

Analyst in Research & Development

Research Company, 1-10 employees

Score 6 out of 10

Vetted Review

Verified User

Incentivized

Use Cases and Deployment Scope

As a PhD student, I used Amazon Elastic MapReduce for my research for analyzing my data. Firstly, it was very scalable and did not cause much performance impact when using large data sets. Secondly, their web console is very easy to use and intuitive. There were many resources that could be used whenever I encountered any problems with EMR.

Pros and Cons

The cluster size of MapReduce is very dynamic and therefore scalability is good for EMR.
It also works well with other Amazon Web Services like Amazon Simple Storage Service, which means that data can be taken from those services and written back to them.
I tried using the in-house hosting at the university I work in, but there would be a lot of complications with technical support required. For Amazon, the support and documentation was good to solve these problems faster.

It would have been better if packages like HBase and Flume were available with Amazon EMR. This would make the product even more helpful in some cases.
Products like Cloudera provide the options to move the whole deployment into a dedicated server and use it at our discretion. This would have been a good option if available with EMR.
If EMR gave the option to be used with any choice of cloud provider, it would have helped instead of having to move the data from another cloud service to S3.

Likelihood to Recommend

If the person using EMR does not need much customization, like debugging or other modifications, or the data is not entirely in another cloud, then Amazon Elastic MapReduce is a better option. Otherwise, there are other open source projects available like Cloudera that are available to be used. Products like Cloudera can also be deployed in any cloud, rather than having to stick with Amazon.

Return on Investment

Positive: Helped process the jobs amazingly fast.
Positive: Did not have to spend much time to learn the system, therefore, saving valuable research time.
Negative: Not flexible for some scenarios, like when some plugins are required, or when the project has to be moved in-house.

Alternatives Considered

Cloudera

EMR provides dynamic cluster size, lots of documentation, and integration with other Amazon Web Services which are some of the things that Cloudera distribution for Hadoop lacked. Some products are hard to learn but EMR was much easier and helped save time spent on trying to figure out how to deploy projects in MapReduce.

Other Software Used

Cloudera Enterprise

Return to navigation

Amazon Kinesis

Google BigQuery

Microsoft Azure

Hadoop

Apache Spark

IBM Analytics Engine

Azure HDInsight

Hortonworks Data Platform

Apache Pig

HPE Ezmeral Data Fabric (MapR)

Db2 Big SQL

Cloudera Manager